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TEMPLATE-DIRECTED INTERFERENCE FOOTPRINTING 
OF PROTEIN-ADENINE CONTACTS 

5 Fi<Jd nf the Invention 

The present invention relates generally to the study of protein-DN A interactions, and 
more particularly to a technique for inhibiting protein-adenine noncovalent contacts without 
disrupting the secondary structure of DNA, thereby providing a template-directed interference 
technique for precisely studying protein-DNA interactions. 

10 

The study of molecular recognition at the interface between protein and DNA is of 
paramount importance in the study of protein regulation of gene expression. Such protein-DNA 
recognition involves noncovalent interaction (hydrogen bonding, ionic interactions, and nonpolar 

1 5 interactions) between nitrogen bases of DNA and amino acids of protein characterized by great 
stability and specificity. 

A variety of techniques, referred to as DNA footprinting, have been developed to study 
these interactions. These techniques typically involve chemical alteration of DNA to inhibit 
interaction with protein, exposure of the DNA to protein, separation of protein-bound DNA from 

20 unbound DNA, and subsequent DNA cleavage and sequencing, or exposure of DNA to protein 
followed by addition of a chemical moiety that will interact with DNA except where protected by 
protein, followed by cleavage and sequencing. The results of a poly aery Iamide sequencing gel 
can identify the location of protein binding in these procedures via identification of a lack of 
DNA fragments in a molecular weight region that would have been present without protein 

25 binding at a specific location, or presence of bands signifying DNA fragments that are present 
because DNA was not allowed to bind at a specific location. 

Specifically, in "DNA protection footprinting", protein is bound to end-labeled DNA and 
the resulting complex is treated with a reagent that interacts wilh DNA (but not at regions bound 
to protein) to produce chemical lesions that give rise to strand cleavage. The conditions of 

30 treatment are adjusted such that each DNA molecule receives statistically one lesion, thereby 
producing a statistical mixture of singly-modified DNA molecules each including a cleavage site 
at a random position along the DNA chain, but not at the site of protein binding. Subsequent 
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cleavage and gel sequencing of the cleaved, protein-bound DNA, and comparison with a 
sequenced control of DNA treated identically but without protein binding, results in a lack of 
bands associated with protein binding positions. 

"DNA interference footprinting'* involves treatment of end-labeled DNA with a reagent 
that prevents protein binding and provides a cleavage site, to produce a probe of statistically 
singly-modified DNA molecules. The pool of DNA is incubated with DNA-binding protein, 
protein-bound DNA is separated from unbound DNA, and both populations are subjected to 
cleavage conditions. The resulting fragments are separated on a sequencing gel and, by 
comparison of the bands representing bound DNA with bands representing unbound DNA, a lack 
of bands in the protein-bound fraction and presence of bands in the protein-unbound fraction, 
each in a region corresponding to a molecular weight distribution of fragmentation at a particular 
location on the DNA, is indicative of protein binding at that region. 

Other fbotprinting techniques, such as DNase I footprinting, Exonuclease III footprinting, 
hydroxyl radical footprinting, diethyl pyrocarbonate footprinting, KMnO, and OsO< footprinting, 
1 5 ethylation interference footprinting, uranyl photofbotprinting, methylation protection and 
methylation interference footprinting, and missing contact footprinting are known and are 
described by C. J. Larson and G. L. Verdine, "The Chemistry of Protein-DNA Interactions", 
Bioorganic Chemistry: Nucleic Acids, S.M. Hecht, Ed., Oxford University Press, New York, 
1996, pages 324-342. Most of the above techniques do not provide information as to which 
20 groove of DNA, major or minor, is involved in binding. 

An optimal footprinting method for determining specific base contacts at Protein-DNA 
binding sites should possess a number of characteristics, including the following: (1) the method 
should be capable of assaying contacts to all four bases in both the major groove and the minor 
groove of DNA, (2) the structure of the interfering probe moiety should be known, (3) the 
25 interference probe should minimally perturb DNA secondary structure relative to its natural 
counterpart, and (4) the method should be operationally simple. Unfortunately, many of the 
footprinting methods described above fall short of these goals since they involve attachment of a 
chemical moiety to DNA that can affect larger-scale phenomena such as DNA conformation. 

Accordingly, template-directed interference (TDI) footprinting (described also in the 
above-reference article of Larson and Verdine) has been developed to circumvent the chemical 
reagent-based approach. TDI footprinting relics upon the ability to incorporate, in the DNA 
polymer itself, a molecule thai is similar enough to one of the nucleic acids to avoid disraption of 
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DNA secondary structure, that possesses the ability to base-pair as would a normal nucleotide, 
but that disrupts protein-DNA interaction, and is cleavable. TDI footprinting offers significant 
advantage over others of the above-described footprinting methods in that alteration of DNA 
secondary structure is avoided, and DNA and/or protein is not induced to act in a manner 
inconsistent with natural protcin/DNA bonding. 

Although TDI footprinting offers significant advantage in the study of protein-DNA 
interaction, availability of the necessary nitrogen base analogs for the technique is limited 
because of the challenging requirements discussed above, TDI footprinting analogs of guanine, 
cytosine, and thymine have been reported (Hayashibara, K. C, Verdine, G. L., J. Am. Chem. 
Sec, 1991, 1 13. 5104-5106; Hayashibara, K. C, Verdine, Q. L., Biochemistry, 1992, 31, 1 1265^ 
11273;Mascareflas,J. U Hayashibara, K- C, Verdine, G. L.^J.Am. Chem. Sue, 1993, 115, 
373-374, respectively). However, although TDI footprinting has been known since at least 1991, 
and cited extensively, TDI footprinting of the DNA base adenine has remained elusive. 
Significantly, without an available, suitable TDI footprinung analog of adenine, TDI footprinting 
as a technique has been constrained in thai it could not be used to analyze contacts to the base 
surface of the entire major groove of DNA, the principal locus of sequence-specific interactions 
in protein-DNA complexes. 

It is, therefore, an object of the present invention to provide TDI footprinting involving 

adenine. 



20 



Summary »ff th« invention 

The present invention provides a molecule that mimics adenine, in that it can be 
incorporated into DNA without adversely affecting normal base pairing to thymine, but that 
disrupts adenirie-dcpcndant DNA binding to proteins. The molecule find particular uulity in a 
25 method also provided by ihe invention that involves study of protein-DNA interaction. 
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The molecule of the invention has a formula: 



5 




where X is a moiety allowing cleavage of double bond 12 under conditions tolerable by a 
protein-DNA complex, R is H or a hydrocarbon, and 2 is an oligonucleotide, 2*-deoxyribose-5- 

io 'triphosphate, or a functionality attachable to an oligonucleotide (which can be selected by one of 
ordinary skill in the art). According to another aspect of the invention, an oligonucleotide, such 
as DNA, including at least one nitrogen base of the above formula is provided. 

A method of the invention involves a technique for determining contacts in a protein- 
DNA complex. A plurality of end-labeled DNA strands arc provided, each including, 

15 statistically, a nitrogen base of the above formula. A DNA-binding protein is added to the 

plurality of strands, protein-bound DNA is separated from unbound DNA, and the DNA strands 
are cleaved. The strands then are sequenced to determine a molecular weight region populated 
predominately by the unbound strands, representative of cleavage sites indicating a protein- 
binding site. 

20 The invention also provides a method of determining contacts in a protein-DNA complex, 

involving investigation with TDJ footprinting analogs of all four DNA bases. The technique 
involves providing four groups of DNA. The first three groups include, respectively, a plurality 
of end-labeled DNA strands that each have, statistically, a TDJ footprinting base analog of 
guanine, cytosine, or thymine. The fourth group includes a plurality of end-labeled DNA strands 

25 that each have, statistically, a TDI footprinting base analog of adenine having a formula I, above. 
A DNA-binding protein is added to each group, protein-bound DNA is separated from unbound 
DNA, the DNA strands are cleaved, and the strands arc sequenced to determine a molecular 
weight region populated predominately by the unbound strands and representative of cleavage 
sites indicating a protein binding site. Provision of, and use in TDI footprinting of, TDI 

30 footprinting base analogs of guanine, cytosine, and thymine is described in the above-noted 
articles of Hayashibara, K. C, Verdine, O. U, J. Am. Chem. Soc. t 1991, 1 13, 5104-5106; 
Hayashibara, K. C. Verdine, G. L., Biochemistry, 1992, 31, 1 1265-1 1273; Mascarcnas, J. L., 
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Othcr advantages, novel features, and objects of the invention will become apparent from 
the following detailed description of the invention when considered in conjunction with the 



Brief Descr iption of the Drawing 
Fig. 1 is a photocopy of nondenaturing polyacrylamidc gel electrophoresis (PAGE) of 
mixtures containing a pool of the inventive adenine analog - substituted DNA and various 
i o concentrations of protein. 



Min, C, Cushing, T- Verdine, G., 'Hemplate-DircctedJnterfcrence Footprinting of 
Protein-Adenine Contacts/*, J. Am. Chem. Soc. 1996, 1 18, 61 16-6120; and Min, C. Verdine, G., 

1 5 "Immobilized Meial Affinity Chromatography of DNA," Nucleic Acids Res. 1 996, 24, 
3806-3810, both are incorporated herein by reference. 

The role of hydrogen bonding in protein-DNA interactions is very important, as hydrogen 
bonds normally make up a large fraction of the total number of contacts in a protein-DNA 
complex. Although it is generally accepted thai hydrogen bonds contribute only modestly to the 

20 overall energetics of specific completion, they scrvaa critical role in determining the 

specificity of protein-DNA interactions. Formation of matched Sets of bonds including hydrogen 
bonds to DNA bases of the optimal binding site allows the entire protein to come into intimate 
contact with the DNA surface, permitting the formation of a large number of. energetically 
favorable nonspecific contacts. 

25 The present invention provides an analog of the DNA base adenine, having the 

generalized formula I. as described above. When R is selected as H, either X is selected so as not 
to be amenable to hydrogen bonding (so as not to hydrogen-bond with an adjacent species) or X 
is selected so as to hydrogen bond with R so as to prevent hydrogen bonding between R and an 
adjacent species. When X is selected so as not to be amenable to hydrogen bonding, it is, for 

30 example, selected from moieties other than F, N, and O. When R is selected as a group not 

including a H amenable to hydrogen bonding, X can be a moiety amenable to hydrogen bonding. 
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accompanying drawing. 



retailed Dwrintion of the Invention 
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R and X should be selected, together, so as not to alter the conformation of species I in a 
manner that would prevent species I from participating in normal Watson-Crick bonding. That 
is, they should be selected so as not to stoically interact in a way that would cause species I to 
twist appreciably. 

5 Preferably, R is H or a hydrocarbon group (including cyclic hydrocarbon groups), 

optionally interrupted by hetero groups. As used herein, "hydrocarbon" is meant to include 
alkyl; alkenyl, alkynyl, cycloalkyl, aryl, alkaryl, aralkyl, and the like. Hetero groups can include 
-O-, -CONH-, -CONHCO-, -NH-, -CSNH-, -CO-, -CS-, -S-, -SO-, -(OCr^CH^R (where n=l- 
1 0), -(CFJa- (where n - 1 - 1 0), olefins, and the tike. Most preferably, R is a small moiety such as 

10 H or CH 3 . R should be selected so as not to interfere with hydrogen bonding between H 10 and 
thymine. 

X is a moiety allowing cleavage of double bond 1 2 under conditions tolerable by a 
protein-DNA complex. Those of ordinary skill in the art can routinely select X according to this 
criterion. Typically, X has electron-withdrawing capacity sufficient to render double bond 1 2 

15 readily cleavable by, for example, aqueous base, so that the TDI footprinting technique described 
above, and described in the above-referenced article of Larson and Vcrdine can be carried out. In 
this case X is, preferably, an electron-withdrawing group such as -CN, -Br, -CI, -NO a , -S0 3 ' - 
SO z R, -COOH, -F, or the like. Alternatively, X can be an electron-donating group including 
electron-donating capacity sufficient such that bond 12 can be cleaved by permanganate, 

20 followed by base. In this case, X can be -OCH 3 , -CH 3 ,»-OH, -Ar (aromatic), -NHCOCH 3 , or the 
like. 

R and X can, together, define a connected, or cyclic, moiety, as will be appreciated by 
those of skill in the art. 

According to one, particularly preferred embodiment where R is H, X is -N0 2 and 
25 interacts by hydrogen bonding with R, preventing hydrogen bonding between R and protein. 

The molecule I of the invention satisfies the following criteria. First, as part of an 
oligonucleotide, such as DNA, it does not disrupt secondary structure of the oligonucleotide. 
Second, it is cleavable chemically at double bond 12 so as to find utility in TDI footprinting. 
Third, it disrupts functionality with protein, achievable when the selection of X and R is made 
30 according to the criteria above. Finally, it is amenable to normal base pairing with thymine, that 
is, atoms involved in Watson-Crick hydrogen bonding are not be altered. 
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Rcfcrring now to structure II, Watson-Crick hydrogen bonding between adenine 1 4 and 
thymine 15 in DNA is illustrated. Adenine 14 



5 




includes nitrogen 16 and hydrogen 18, involved in hydrogen bonding with thymine, which 
hydrogen bonding is not disrupted in accordance with the invention. Natural adenine includes as 
well hydrogen 20 and nitrogen 22 (including representative electron cloud 24), both of which are 

! 5 amenable to hydrogen bonding with protein interacting with DNA, at least one of which is 
prevented front hydrogen bonding in accordance with the invention. 

Referring now to arrangement III, adenine 14 and thymine 1 5 arc shown as paired in 
DNA, and a portion 26 of the protein glutamine is illustrated including hydrogen 28 which is 
hydrogen-bonded to nitrogen 22 of adenine, and oxygen 30, hydrogen-bonded to hydrogen 20 of 

20 adenine (see Larson and Vcrdine, "The Chemistry of jWin-DNA Interactions" referenced 
above, page 329). C a 



25 



30 
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It is one of these hydrogen bonds, between hydrogen 28 and nitrogen 22, or between oxygen 30 
and hydrogen 20, that is to be disrupted in accordance with the invention. Preferably, both are 
disrupted. 

5 According to one preferred embodiment of the invention (with reference to formula I), X 

is -N0 2 , and R is H- This molecule, illustrated as formula IV below, allows for hydrogen 
bonding between X and R, namely between oxygen 32 and hydrogen 34 (R), precluding any 
hydrogen bonding between X or R and a protein. 



10 
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The function and advantage of these and other embodiments of the present invention will 
be more fully understood from the examples below. The following examples are intended to 
illustrate the benefits of the present invention, but do not exemplify the full scope of the 
20 invention. References cited in the following example is incorporated herein by reference for all 
purposes. 

Pxample 1 ; Synthesis o f Analog Suirahte for TP! Footprint! no of Maior Groove Contacts to 

Adenine 

25 This and the following examples are directed toward the synthesis and validation of an 

adenine analog suitable for TDI footpriming. A species having the generalized formula (I), 

specifically 7-deaza-7-mtroadenine. 

Materials: 7-Deaza-2 , -deoxyadenosine (2'-deoxytubcrcidin) (Cottam, H. B.; 

Kazimicrrczuk, Z.; Geary, S.; McKeman, P. A.; Rcvankar, G.R.; Robins, R. L., J. Med. Chem. 
30 1985, 28 1452-1467) was obtained from Boehringer Mannheim. Naturally occurring dNTPs 

were from Pharmacia (Piscataway, NJ). Tag polymerase was from Promega (Milwaukee, WI). 

T4 polynucleotide kinase was from Gibco BRL (Gaiihersburg, MD), and glycogen was from 



r ; 
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Boehringer Mannheim (Indianapolis, IN). Sequenasc 2.0 and the Scqucnasc 2.0 kit were from 
United States Biochemical (Cleveland, OH). [y-"P]ATP was from New England Nuclear. Full- 
length 434 repressor was expressed in Escherichia coli using XA90/pRWl90 and purified as 
described (Anderson, J.; Ptashne, M.; Harrison, S.C. Proe. Natl Acad. Set. U.S.A. 1984. 81, 

5 1307-1311). 4 
Synthesis of 4-Ainmo-7-(2^eoxy-3,5-diacetoxy-6-D-eryi hro- 
pentofuranosyl)pyiToIo[2,3-d3pyridimine (2): To a stirred suspension of 4-amino-7-(2-deoxy-6- 
D-eo^^^P^^ft^osyOpoy^^l 2 , 3 ^]?^^^^ (Z'-deoxyiubcrcidin, 1) (Cottam et al„ 
above) in pyridine (1.6 mL, 0.4M) at 0°C under N 2 was added acetic anhydride (0.33 mL, 3.5 

10 mmoL 5.3 equiv). The mixture was allowed to stir overnight, reaching ambient temperature in 
that time. The solution was diluted with methanol (3 mL), stirred briefly, reduced to dryness, 
and was placed on Ihc vacuum line for a few moments. The residue was then subjected to a 
refluxing solution of methanol (3 mL) for 12 h. The solution was again reduced to dryness, 
yielding 240 mg of a crude solid. Chromatography: flash column (20 g of silica) 2% 

1 5 MeOH/CH 2 Cl 2 to 5% McOH/CH 2 Cl 2 . The product was obtained as a white foamy solid. Yield: 
187 mg, 85% »H NMR (300 MHZ) (CDCi 3 relative lo TMS) 6 2.09 (6H, s); 2.48 ()H, ddd, J-2.I, 
4.7, 14.0 Hz); 2.59 - 2.69 (1H, m); 4.27-U7 (3H, m); 5.31-5.34 (1H, m); 5.61 (2H, br ) 
exchanged); 6.41 (1H. d, J=1.0, 3.7 Hz); 6.68 (1H, dd, J=5.7, 8 7 Hz); 7.1 1 (1H, 3, J=3.7 Hz); 
8.29 (1H, s). ,3 C NMR (126 MHZ) (CDC1,):6 20.76, 20.87, 37.44, 64.02, 74.63, 81.54, 83.49, 

20 99.44, 103.89. 120.89, 140.67, 151 .98, 156.88, 170.38* IR (thin film, CDCl 3 ): 3152, 1740. 1734, 
1684, 1653, 1233 cm'. MS (El): m/c (rel intens) 334 (M% 0.1); 275 (0.5); 134(100); 107(0.15). 
El HRMS (CrXM+1): 335.1355 (C^H.^O, requires 334.3328). 

Synthesis of 4- Amino-5-nitro-7-(2-deoxy-3,5Hiiacetoxy-p-D-eo^''''P-- 
pcntofuranosyi)pyrrolo[2^Hi]pyridimine (3): To a vigorously stirred 0'C solution of 2 (97.9 

25 mg, 0.29 mmol, 1 .0 equiv) in CH 2 C1 2 (1 .9 mL, 0.15 M) was added a mixture of concentrated 
H 2 SO, (320 mg) and fuming HN0 3 (260 mg) in a dropwise fashion. After 20 min the ice bath 
was removed and the biphasic solution stirred for an additional 20 min and then neutralized with 
a saturated NaHC0 3 solution until the pH reached 7.0. The mixture was extracted with CTI 7 .CI 2 
(-200 mL), and the bright yellow solution was washed with brine, dried over MgSO«, and 

30 reduced to dryness. Crude yield: 6.33 mg, '0%. The resulting dark yellow solid was used 

without further purification. 'H NMR (300 (CDCI 3 ). 62.09 (3H, s); 2.14 (3H, s); 2.49-2.58 
(1H, m); 2.68 (1H, ddd, J-3.1, 6.0, 17.2 Hz); 4.34 (3H, br s): 5.31 (1H, m); 6.40 (1H, br s, D 2 0 
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exchanged); 6.62 (III, t, J-6.4 Hz); 7.56 (1H hrs D a O exchanged); 8.26 (IH, a); 8.276 (1H, s). 
,J C NMR(126 MHZ) (CDC! 3 ): 620.69, 20.79, 38.82, 63.53, 73.83, 82.62, 84.62, 95.74, 95.78, 
125.15, 129.55, 149.56, 154.12, 156.41, 170.23, 170.30 IR (thin film. CDC1 3 ): 3150, 1740, 1734, 
1684, 1653, 1570, 1522, 1506, 1234 cm'. MS (CI): m/e (reJ intens) 380 (M~ 1, 0.2); 294 (0.60); 
5 236(100). 

Synthesis of 4-Amino-5-mtro-7-(2-deoxy-(M>«iythro-pentofa 
d]pyridiminc (4): To a stirred solution of 3 (1 7.4 mg„ 0.046 mmol, 1 .0 equiv) under N 2 at 0°C 
in methanol (1 .3 mL) was added H 2 CG 3 (1 3 mg, 0. 10 mmol, 2.2 equiv. After 20 min the reaction 
mixture was diluted with brine (25 mL) and extracted with ethyl acetate (1 00 mL). The aqueous 

10 layer was then re-cxtracted with ethyl acetate (3 x 50 mL), and the organic fractions were 

combined, dried over MgS0 4 , and evaporated to yield a bright yellow solid. Chromatography: 
PTLC 5% MeOH/CH 2 -Cl 2 . Yield: 8.8 mg, 65% 'H NMR (300 MHZ) (methanol-d,,): 62.42-2.48 
(IH, m); 2.53-260 (IH, m); 3.73-3.87 (2H, m); 4.03 (1H, t, J=2.9 Hz); 4.53 (1 1 1, dd, J=l 8, 2.9 
Hz); 4.86 (2H, s, D 2 Q exchanged); 4.87 (2H, s,D 2 0 exchanged); 6.57 (1H, dd, J=4.4, 6.2 Hz); 

15 8.17 (IH, d, J=2.5 Hz); 8.70 (1H, d, J-2.5 Hz). 13 C NMR (126 MHz) (methanol-d,): 6 41.73, 
62.75, 72.14, 86.87, 89.22, 96.88, 128.70, 130.12, 150.12, 154.75, 157.96. IR (thin film, 
MeOH): 3445-3335 or, 1684, 1653, 1570, 1522, 1507, 1269 cm '. MS (El): m/e (rel intens) 295 
(M\ 0.1); 200 (0.25); 179 (100). El HRMS: m/e 295.0904 (C u H„r4 f O s requires 295.0917). Mp: 
186-190°C. 

20 Synthesis of 7-deaza-7-nil^o-2'-deoxyadenosine , 5'-Tri phosphate (dA*TP,5): To a stirred 

solution of 4 (4.1 mg, 14 ^mol, J .0 equiv) at 0°C under N 2 in dry trimethyl phosphate (50 //L) 
was added POCl 3 (1.7 fiiL t 18 ^mol), 1.3 equiv). After 15 h a solution of tris- 
(tributylammonium) pyrophosphate (0.14 mL, 1M, in DMF) was added to the reaction flask, and 
stirred overnight (0°C to room temperature). At this time El 3 N (43 j/L) and H 2 0-d 2 (612 jzL) 

25 were added to the flask and stirred for 5 min. The mixture was lyophilized and the residue 

dissolved in 100 AtL of H 2 0-d 2 and passed through an ion exchange column (Scphadex A25, 1 g) 
equilibrated in 0.1 M aqueous triethyl-ammonium bicarbonate (TEAB). The column was 
washed with 50 mL of H 2 0-d 2 and then eluted with 950 mL gradient from 0 to 2M 
triethyiammonium bicarbonate, with 10 mL fractions being collected. Fractions 1 1 -27 were 

30 combined and lyophilized to dryness, and the remaining white powder was rcdissolved in 1 mL 
of TE buffer: 10 mM Tris-HCI (pH 8.0), 1 mM EDTA. This solution was used directly in 
extension reactions with Sequenascs 2.0. The yield was estimated to be 5.7 /*mol (41%) by 
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spectrophotometry analysis using the reported extinction coefficient (JL^ - 365 nm, e=3900) 
for the related compound 5-nitrotubercidin 9 . M P NMR (202 MHz) (D 2 0 referenced to 
phosphoric acid): 6-5.4(d); -6.1(m); -19.5(m), UV(H 2 0): X MI - 356, 364 nm. 

s Example 2: Confirmation of the Ability of the Species of Example } to Promote Ring 
Fragmentation and DNA Strand Scission 

7-deaza-7»nitro-2 < -deoxyadenine (dA*) was synthesized by nitration of 2- 
deoxytubercidin. A sample of dA* was dissolved in 1M aqueous piperidinc at 90 Q C, and 
aliquots were periodically removed and spotted onto a TLC plate. Analysis of the resulting TLC 
10 plate revealed virtually complete disappearance of the dA* chromophobe within 30 min. These 
data indicated that the base moiety of dA* undergoes hydrolytic degradation under the conditions 
used in piperdinc-catalyzcd cleavage of DNA, ihus it is possible to cleave DNA selectively at 
positions containing the analog. 

15 pimple Incorporation of 7-dcaza-7-niiroadenme/dA*\ into DNA 

Preparation of Single-Siranded Template DNA by PCR/IMAC (PCR/metal affinity 
chromatography): Complementary oligonucleotides containing the operator and flanking 
sequences used in the crystallographic study of the 434 repressor (N-tcrrninaJ domain)/0 R l 
complex were synthesized with 3 'PslI -compatible ends, ligatcd into the Pstl site in the 

20 poly linker of the phagemid pBS-t~ (Stratagene), and transformed into £. coll strain XL 1 -blue 
(Stratagene). Insert-positive recominants (designated pBS-434) were verified by sequencing of 
single-stranded phagemid DNA isolated from helper phage VCS-M13 (Stratagene). 
Hayashibara, K.C. PhD. Thesis. Harvard University, Cambridge, MA 1993. 

The polymerase chain reaction followed by immobilized metal affinity chromatography 

25 (PCR/IMAC) was used to generate single-stranded DNA templates containing the operator 
sequence. This technique is reported in a co-pending, commonly-owned United States 
provisional patent application of Verdine, et al., filed July 3, 1996 (Serial No. 60/021,195) and in 
Min, C, Verdine, G,, "Immobilized Metal Affinity Chromatography of DNA," Nucleic Acids 
Res. 1996, 24, 3806-3810, both incorporated herein by reference. In the technique, a moiety that 

30 can coordinate a metal ion is inserted into DNA. Specifically, DNA sequencing primers 
containing six successive 6-hisiaminyipurine residues added onto the 5'-cnd of the DNA was 
synthesized. The PCR/IMAC technique is preferred, although other techniques can be used, 
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such as an asymmetric PCR technique. Alternate techniques are discussed in the Min. ct al. 7 
paper, Nucleic Adda Res., 1996, 24, 3806, referenced above. 

The 6-histarninylpurine moiety was introduced through the convertible nucleotide 
approach by analogy to reported procedures (Fcrcntz, A.E.; Vcrdine, G.L., J, Am. Chem. Soc. 
5 1 99 1 , / / J, 4000-4002; Fcrcntz, A.E.; Vcrdine, G.L. In Nucleic Acids and Molecular Biology; 
Eckstein, F., Lilley, D.M.J. Eds.; Springer- Vcriag: Berlin, 1994; Vol. 8, pp. 14-40. Briefly, 
resin-bound oligonucleotides containing O^phenyl^-deoxyinosinc (4>dl) in place of the dH 
residue were deprntected from the resin by mild ammonia treatment (cone NI^OH, nt, 4b) 
lyophilized to dryness, then treated with 5 M aqueous histamine (55 °C, 14 h) to convert the 4>dl 

10 residues to dH. The crude oligonucleotides were purified by denaturing polyacrylamide gel 
electrophoresis. An additional dG residue was added at the S'-cnd to ensure efficient end- 
labeling using polynucleotide kinase. 

The Htf-tag confers, upon an oligonucleotide, the ability to be selectively and reversibly 
retained on a surface, specifically Ni 2 *-NTA-agarose. The result is a single strand of DNA 

1 5 immobilized at a surface. 

Following removal of the supernatant, which contains unbound DNA, the resin was 
washed and the bound DNA eluted with 200 mM imidazole. 

Amplification was carried out in a 100 PCR reaction mixture containing 50 mM KC1, 
1 0 mM Tris-HCl (pH 8.4), 1 .5 mM MgCI 2 , 0.25 mM (each) dATP, dCTP, dGTP, and TTP, 1 

20 unit of Taq polymerase, 20 ^g of pBS-434 as template* 50 pmol of H 6 - tagged Ml 3 (-48) primer 
[5'-d(GHHHHHHAGCGG ATA AC AATTTC ACAC AGO)-3 '] , H = 6-histaminylpurine, and 50 
pmol of M13 (-47) primer f 5'-dCGCC AGGGTTTTCCC AGTC ACG A)-3 '] . The reaction 
solution was overlaid wjth a 60 ptL mineral oil cap to prevent evaporation and placed in a 
thermal cycler (MJ Research) programmed for 35 cycles set at 94 °C (1 min), 55°C (1 min) and 

25 72°C (1 min). 

The duplex PCR products were resolved by IMAC as follows: 1 mL of TE buffer was 
added to the crude reaction mixture, which was then concentrated to a total volume of 50 yX* 
using a centrifugal dialysis carUide (Centricon 30, Amicon). The DNA solution was transferred 
to an Eppcndorf tube, to which was added 150 fxh of binding buffer (6 M guanidine-HCl, 10 mM 
30 Tris, pH 8.2). This mixture was heated for 5 min at 90 °C. Separately, to a 1 .5 mL Eppcndorf 
tube was added 250 a*L (bed volume) of Ni 2 *-NTA agarose resin and 1 mL of binding buffer at 
room temperature. The hot DNA solution was added to the suspension of resin, and the resulting 
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mixture was mixed for 1-1.5 min by vigorous shaking. The mixture was transferred to an empty 
5 mL fitted column (Qiagen), and the flow-through fraction, which contained the unmodified 
strand, was collected into an Eppendorf tube. After repeated pipetting to ensure complete 
mixing, the unbound fractions were aliquotted into four Eppendorf tubes (300 fxL each) and held 
5 aside for further processing. Next, the resin was washed with 1 mL of washing buffer ( 1 0 mM 
Iris, 5 mM imidazole, pH 8.0), which was discarded. The H 0 -containing strand was then eluted 
in 1-2 mL of 200 mM aqueous imidazole solution, The imidazole eluate was mixed thoroughly 
and aliquotted into four Eppendorf tubes (300 (jL each), to each of which were added 30 fzL of 
100 mM ethanolic 1 , 10-phenanthroline and 30 of 3 M aqueous NaOAc. To each of the 

10 unbound and bound fractions was added 900 fjL of absolute ethanol (stored at -20 °C), and then 
the tubes were vortcxed briefly and chilled for 30 min of powdered solid CO,. The tubes were 
microccntrigued for 30 min at 15000g. The supernatant was removed and the pellet washed with 
200 /uL of 80% aqueous ElOH (-20 °C). Following removal ofrthe ethanol solution, the tubes 
were evaporated to dryness by centrifugal lyophilizatton (SpecdVac, Savant). To each dry tube 

1 5 was added 50 vL of TE buffer, and the DNA concentration was quantified by UV 

spectrophotometry. This procedure typically yields 5-10 pmol of the H<r tagged template and the 
unmodified template (the primers are used in -5-10 fold excess in the PCR reaction). 

Template-Directed Extension: The procedure provides a sufficient template for 20 lanes 
on a DNA sequencing gel. 

20 (a) End Labeling of Primer: A 20 fit volume of 0.2 pmol/^L M13 (-21) primer, 5- 

d(GTAAAACGACGGCCAGT>3\ was end labeled in a 100 fiL volume by adding 10 fiL of 400 
mM Tris*HCl (pH 7), 200 mM MgCl 2 , 20 units of T4 polynucleotide kinase, and 4 ptL of 6000 
Ci/mmol [y- 32 P]ATP. This mixture was incubated at 37°C for 30 min, heated at 90"C for 3 min, 
chilled on ice, and briefly micro centrifuged at 14000 rpm. 

25 (b) Annealing; A 100 fxL volume of the DNA template strand (approximately 0.04 

pmol/^L in TE) was added to the end-labeled primer (100 t*L) and healed to 65 °C for 5 rain, and 
then allowed to reach room temperature while remaining in the bath. The condensate was 
recombined with the annealed primer/template solurion by brief microcentrifugation. 

(c) Primer Extension: A 200 sample /iL of annealed primer/template (4 pmol) was 

3o combined with 70 ^L of polymerization buffer [48 mM Tris-HCl (pH 7), 25 mM MgCl 2 . 200 
mM NaCl, 9mM DTT, and 400 pg/fj-L BSA). The resulting solution was added to 200 of a 
mixture containing 80 /jM each of dCTP, dGTP, and dTTP; and 80 /xM (total nucleotide) 1 :3 
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mixture of dA*TP and dATP; and 50 j*M NaCL. The extension was allowed to proceed at 37°C 
for 20 min and then stopped by addition of 200 of 3 M sodium acciate (pH 7.0), and 
precipitated at -70 °C with 20 of 20 mg/mL glycogen and 1800 of 100% eihanol. The 
pellet obtained after mierocentrifugation at 14000 rpm for 30 min at 4°C was washed with 80% 
5 aqueous ethanol and dried by lyophilization. The resulting DNA was dissolved in 80 ptL of T£, 
4 AtL of which was then used for each subsequent incubation with protein. 

Rvnfn r f ff 4: Ritirfinp Studies Involving Adenine Analog of the Invention Incorporate t^in pisf A 
1 mM CaCl 2 , 100 Atg/mL DSA, and 10% (v/v) glycerol: Protein-bound and unbound 

to DNA were separated using a 5% low-ionic-strength nondenaturing polyacrylamide gel. 

(Chodtsh, L.A. In Current Protocols In Molecular Biology, Ausabei, F.M., Brent, R. Kingston, 
t..E., Moore, D.D., Smith, J.A. Scidman, J.G., Struhl, K., Eds; John Wiley and sons: New York, 
1989; Vol. 2,pp 12.2.1 - 12.2.10). (37:1 monomerpnethylenebisacryl-amide). The gel was 
exposed to BioMAX film (Kodak) for 5 h. Bands were visualized by autoradiography and 

15 excised from the gel using a razor blade. The gel slices were electroeluted into 130 of 10M 
ammonium acetate using a V -channel apparatus (Harvard BioLabs) with 0.5x TBE (45 mM Tris- 
boratc (pH 8.0), 0.5 mM EDTA] as running buffer. The DNA samples in 10 M ammonium 
acetate were precipitated directly by the addition of 3 volumes of 100% ethanol followed by 
precipitation as described above. 

20 Cleavage and Sequencing Procedure. Each 32 PMabeled dsDNA sample was redissolved in 

50 jxL of 1 M aqueous piperidine. This solution was heated at 90 °C for 30 min, cooled to room 
temperature, and micro centrifuged briefly. Samples were lyophilized, twice resuspended in 
30pL of H a O-dj and lyophiiized and then redissolved in 3 of TE and 2 fjL of loading dye 
containing 95% formaminc, 1 0 mM EDTA, 0.025% bromophenol blue, and 0.025% xylene 

25 cyanol. Samples were heated at 90 °C for 2 min and then loaded immediately onto a 7% 

polyacrylamide sequencing gel. The gel was visualized by autoradiography. The generation of 
an authentic sequence by Sanger sequencing made use of the Scquenase 2,0 kit (United States 
Biochcmicals) according to the manufacturer's protocol. A volume of 2.5 fA- of annealed 
p J Pjprimcr;tcmplate mixture was used in each dideoxy termination reaction. 

30 High resolution electrophorelic analysis of the products arising from extension and 

cleavage resulted in a pattern of bands of the A* lanes matching precisely that in the Sanger 
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dideoxy-A sequencing lane, thus indicating that the analog shows the same base-pairing 
preference as its natural counterpart. 

The titration series was fractionated by nondenaturing polyacrylamidc gel electrophresis 
(PAGE) , which separates naked DNA from protein-bound DNA on the basis of differences in 

5 size (Fig. 1). Hie bound and unbound DNA Bands were extracted from the native PAGE gel, 
cleaved with piperidinc, and analyzed on a high-resolution denaturing polyacrylamidc gel. 
Interference was evident in a TDI-A footprint as positions in the sequence at which a band was 
weak or absent from ihc bound lanes and enriched in the unbound lanes, reltive to the A* control. 
Such behavior was clearly observed at one and only one position in the sequence: the 5'-A 

1 o residue of O r 1 - A band at A 1 wais virtually absent from the bound lanes at all protein 

concentrations, thus indicating that binding by 434 repressor was strongly and adversely affected 
by the analog at this site. Conversely, the relative intensity of the Al band in the unbound lanes 
increased as the protein concentration was increased, again suggesting that DNA molecules 
having the analog at position - 1 bind 434 repressor poorly. 

1 5 Those skilled in the an will readily appreciate that all parameters listed herein are meant 

to be exemplary and actual parameters will depend upon the specific application for which the 
methods and apparatus of the present invention arc being used. It is, therefore, to be understood 
that the foregoing embodiments arc presented by way of example only and that, within the scope 
of the appended claims and equivalents thereto, the invention may be practiced otherwise than as 

20 specifically described: 1 
What is claimed is: 
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claims 

1 . A species of a formula; 



5 




10 

where X is a moiety allowing cleavage of double bond 3 2 under conditions tolerable by a 
protcin-DNA complex, R is H or a hydrocarbon, and Z is an oligonucleotide, 2'-deoxyribose- 
5'triphosphate, or a functionality attachable to an oligonucleotide. 

15 2. A species which is an oligonucleotide including at least one nitrogen base having 

a formula: 

x *V H4r W* 



20 




25 where X is a moiety allowing cleavage of double bond 12 under conditions tolerable by a 
protein-DNA complex, R is H or a hydrocarbon, and Z is the oligonucleotide. 

3. A species as in cither of claims 1 or 2, wherein R is H and X is selected so as not 
30 to be amenable to hydrogen bonding, and X and R are selected so as not to stericalty interact in a 
way that would cause the species to twist appreciably and thereby to prevent the species from 
participating in normal Watson-Crick bonding. 
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4. A species as in cither of claims 1 or 2, wherein R is H, and X is selected so as to 
hydrogen bond with R, and X and R are selected so as not to sterically interact in a way that 
would cause the species to twist appreciably and thereby to prevent the species from participating 
in normal Watson-Crick bonding. 

5 

5. A species as in either of claims 1 or 2, wherein R is a group not including a H 
amenable to hydrogen bonding. 

6. A species as in any of claims 1 -4, wherein R is H or a hydrocarbon group. 

10 

7. A species as in claim 6, wherein R is H or CH 3 . 

8. A species as in any preceeding claim, wherein X has electron-withdrawing 
capacity sufficient to render the double bond adjacent the carbon of the ring to which it is 

15 coupled clevable by aqueous base. 

9. A species as in claim 8, wherein X is -CN, -Br, -CI, -NO,, -SCV.-SOjR, -COOH, 

or-F. 

20 10. A species as in any of claims 1-8, wherein X is -OCH 3 , -CH 3 , -OH, an aromatic 

group, or -NHCOCH,. 

11. A species as in any preceeding claim, wherein R is H and X is -N0 2 . 
25 12. A species as in any preceeding claim, wherein 2 is an oligonucleotide. 



13. A method of determining contacts in a protein-DNA complex, comprising: 
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providing a plurality of end-Iabclcd DNA strands each including, statistically, a 
nitrogen base of a formula: 




to 

where X is a moiety allowing cleavage of double bond 12 under conditions tolerable by a 
protein-DNA complex, R is H or a hydrocarbon, and Z is an oligonucleotide, 2'-dcoxyribose- 
5 'triphosphate, or a functionality attachable to an oligonucleotide; 

adding DNA-binding protein to the plurality of strands; 
15 separating protein-bound from unbound DNA; 

'cleaving the DNA strands, and sequencing the DNA strands to determine a 
molecular weight region populated predominately by the unbound strands and representative of 
cleavage sites indicating a protein binding site. 

i 

20 14. A method as in claim 1 3, further comprising determining contacts in a protein- 

DNA complex involving investigation with analogs of all four DNA bases, comprising: 

providing a first group of DNA including a plurality of end- labeled DNA strands that 
each have, statistically,' a TDI footprint ing base analog of guanine; 

providing a second group of DNA including a plurality of end-labeled DNA strands thai 
25 each have, statistically, a TDI footpriniing base analog of cytosine; 

providing a third group of DNA including a plurality of end-labeled DNA strands that 
each have, statistically, a TDI footpriniing base analog of thymine; 

providing a fourth group of DNA including a plurality of end-labeled DNA strands that 
each have, statistically, a TDI footprinting base analog of adenine having the formula I; 
30 adding DNA-binding protein to the plurality of strands of each group; 

separating protein-bound from unbound DNA in each group; 
cleaving the DNA strands of each group; and 
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scqucacing the ONA strands of each group to determine a molecular weight region 
populated predominately by the unbound strands and representative of cleavage sites indicating a 
protein binding site. 

5 

15. A method as in cither of claims 13 or 14, wherein R is H and X is selected so as 
not to be amenable to hydrogen bonding, and X and R are selected so as not to sterically interact 
in a way that would cause the species to twist appreciably and thereby to prevent the species 
from participating in normal Watson-Crick bonding. 

10 

16. A method as in either of claims 13 or 14, wherein R is H, and X is selected so as 
to hydrogen bond with R, and X and R are selected so as not to sterically interact in a way that 
would cause the species to twist appreciably and thereby to prevent the species from participating 
in normal Watson-Crick bonding. 

15 

17. A method as in either of claims 13 or 14, wherein R is a group not including a H 
amenable to hydrogen bonding. 

18. A method as in any of claims 13-17, wherein R is H or a hydrocarbon group. 

20 

19. A method as in claim 1 8, wherein R is H or CH 3 . 



20. A method as in any of claims 13-19, wherein X has electron-withdrawing capacity 
sufficient to render its adjacent double bond clcvable by aqueous base. 

25 

21 . A method as in claim 20, wherein X is -CN, -Br, -CI, -N0 2 , -S0 3 , -S0 2 R, - 
COOH.or-F. 

22. A method as in any of claim 13-19, wherein X is -OCII,, -CH„ -OH, an aromatic 
30 group, or -NHCOCH3. 

23. A method as in any of claim 1 3-22, wherein R is H and X is -N0 2 . 
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24. A method as in any of claims 1 3-23, wherein Z is an oligonucleotide, 2- 
dcoxyribose-5'triphosphate, or a functionality attachable to an oligonucleotide. 
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